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RAW SEQUENCE LISTING DATE: 07/22/97 

PATENT APPLICATION US/08/808,031 TIME: 10:54:10 

INPUT SET: S191S8.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 




SEQUENCE LISTING 

) General Information: 

(i) APPLICANT: Inouye, Sumiko 
Hsu, Mei-Yin 
Eagle, Susan 
Inouye, Masayori 

(ii) TITLE OF INVENTION: Prokaryotic Reverse Transcriptase 
(iii) NUMBER OF SEQUENCES: 45 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Weiser & Associates 

(B) STREET: 230 South Fifteenth Street, Suite 500 

(C) CITY: Philadelphia 

(D) STATE: Pennsylvania 

(E) COUNTRY: U.S.A. 

(F) ZIP: 19102 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/808,031 

(B) FILING DATE: 03-MAR-1997 

(C) CLASSIFICATION: 435 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/269,118 

(B) FILING DATE: 30-JUN-1994 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Weiser, Gerard J. 

(B) REGISTRATION NUMBER: 19,76 3 

(C) REFERENCE/ DOCKET NUMBER: 377.5888P 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-875-8383 

(B) TELEFAX: 215-875-8394 



PAOF. 7 RAW SEQUENCE LISTING DATE: 07/22/97 

^^^^'^ PATENT APPLICATION US/08/808,031 TIME: 10:54:15 

INPUT SET: S191S8.mw 

47 (2) INFORMATION FOR SEQ ID N0:1: 
48 

49 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 2176 base pairs 

51 (B) TYPE: nucleic acid 

52 (C) STRANDEDNESS: double 

53 (D) TOPOLOGY: linear 
54 

55 

56 (ix) FEATURE: 

57 (A) NAME/KEY: CDS 

58 (B) LOCATION: 640.. 2094 
59 

60 

61 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
62 

63 TCATCCGCGC GGACACCCCC TCCTACGTGC CCCCCGACGC GGAGAGCGGC GTGGAGACGG 60 

65 TGTACCGCGT TTCCCTGGAT GGTCACCTGG TGGCGGTGGA GTGGGGCCCG CGCACGGGCT 120 

67 CGCCGCGTCA CCAGCGGCTC TGGTTCGACT CGGATGCGGA AGCCCCCGGA GCCTACTTCG 180 

69 CGCGCCTCGA GAAGTTGGCG GCTGACGGCT ACATCGACGC GGCCTCGGCA TTGGTCTAAA 240 
70 

71 CCCTTCAACC ACGGCTCGGC CGCCACGCGC GGCCGGCAGG ACAGGTGCGA CGAACAGACG 300 
72 

7 3 ACGACGTGCG CTTCACGCGC GAGCAGCCGA GAGAGGTCCG GAGTGCATCA GCCTGAGCGC 360 
74 

75 CTCGAGCGGC GGAGCGGCGT TGCGCCGCTC CGGTTGGAAT GCAGGACACT CTCCGCAAGG 420 
76 

77 TAGCCTGTTC TTGGCTCTCT CCCTCCTAGG CACTACGGCC AGGGTGGGTA GCGGAGCCAA 480 
78 

79 CGACGCCACC GCCGTTTACC CACCCCGGCC GTAGTGCCTA GGAGGGGAGA GCCGGTGAGG 540 

80 ^ rtrt 

81 CTACCGTGCC CCAGGTAAGA TGGTGGTGCT TTCCCGGCCT CCGTCGACTG CTCGCGCCAT 600 

82 

83 GTCCCGTCTT CCATCGCCGC GCCCGCCCAA GGTGCAGAC ATG ACC GCC AGG CTG 654 

Met Thr Ala Arg Leu 
1 5 



GAC CCG TTC GTC CCC GCA GCT TCG CCG CAG GCC GTG CCC ACG CCC GAG 702 
Asp Pro Phe Val Pro Ala Ala Ser Pro Gin Ala Val Pro Thr Pro Glu 
10 15 20 



84 
85 
86 
87 
88 
89 
90 

91 CTC ACC GCT CCG TCG TCA GAC GCG GCC GCG AAG CGT GAA GCC CGC CGG 750 

92 Leu Thr Ala Pro Ser Ser Asp Ala Ala Ala Lys Arg Glu Ala Arg Arg 

93 25 30 • 35 

CTC GCG CAC GAA GCG TTG CTC GTC CGff GCG AAG GCC ATC GAC GAA GCG 798 

Leu Ala His Glu Ala Leu Leu Val Arg Ala Lys Ala He Asp Glu Ala 
97 40 45 « 50 
98 

99 GGC GGC GCC GAC GAC TGG GTG CAG GCG CAG CTC GTC TCC AAG GGG CTC 846 



94 
95 
96 



If 
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PATENT APPLICATION US/08/808,031 



DATE: 07/22/97 
TIME: 10:54:19 



100 


Gly 


Gly 


101 




55 


102 






103 


GCG 


GTC 


104 


Ala 


Val 


105 


70 




106 






107 


GCC 


TGG 


108 


Ala 


Trp 


109 






110 






111 


AAG 


CGT 


112 


Lys 


Arg 


113 






114 






115 


GGC 


GCG 


116 


Gly 


Ala 


117 






118 






119 


GTG 


CCC 


120 


Val 


Pro 


121 




135 


122 






123 


GAG 


TCC 


124 


Asp 


Ser 


125 


150 




126 






127 


CTC 


CGC 


128 


Leu 


Arg 


129 






130 






131 


GTG 


AGC 


132 


Val 


Ser 


133 






134 






135 


TCC 


CCC 


136 


Ser 


Pro 


137 






138 






139 


GTC 


GTG 


140 


Val 


Val 


141 




215 


142 






143 


GGA 


CGC 


144 


Gly 


Arg 


145 


230 




146 






147 


GTG 


GTC 


148 


Val 


Val 


149 






150 






151 


CGG 


GTG 


152 


Arg 


Val 



Asp Asp Trp 



GGC 
Gly 
120 

CAC 
His 



GCG 
Ala 



TGG 
Trp 



GAC 
Asp 



GAG 
Glu 



GCG 
Ala 
105 

GTG 
Val 



CGC 
Arg 



GAG 
Glu 



TTC 
Phe 



AAG 
Lys 
200 

GAG 
Glu 



TCC 
Ser 



ACC 
Thr 
185 

CCT 
Pro 



CGG 
Arg 



ATC 
He 



GTG 
Val 



GGC 
Gly 



CTG 
Leu 



AAG 
Lys 
90 

CAC 
His 



CAC 
His 



GAG 
Glu 



GCG 
Ala 



GCG 
Ala 
170 

ATT 
He 



GAG 
Glu 



CTG 
Leu 



CTC 
Leu 



GAC 
Asp 
250 

CTG 
Leu 



GAC 
Asp 
75 

AAG 
Lys 



GAG 
Glu 



TGG 
Trp 



GAG 
Glu 



CTG 
Leu 
155 

TTC 
Phe 



CCG 
Pro 



CTG 
Leu 



CCG 
Pro 



ACC 
Thr 
235 

CTC 
Leu 



TTG 
Leu 



Val 
60 

TTC 
Phe 



AAG 
Lys 



GCG 
Ala 



GCG 
Ala 



CGC 
Arg 
140 

GCC 
Ala 



CAC 
His 



AAG 
Lys 



AAG 
Lys 



GTC 
Val 
220 

AAC 
Asn 



AAG 
Lys 

CGC 
Arg 



Gin Ala Gin 



TCC 
Ser 



GCC 
Ala 



TGG 
Trp 



GAG 
Glu 
125 

GCC 
Ala 



AAG 

Lys 



CGG 
Arg 



CGG 
Arg 



GCA 
Ala 
205 

CAC 
His 



GCG 
Ala 



GAC 
Asp 



AAG 
Lys 



AGC GCC 
Ser Ala 



GAG GCC 
Glu Ala 
95 

AAG GCC 
Lys Ala 
110 

GAC CGC 
Asp Arg 



CGG GCC 
Arg Ala 



GCG CTG 
Ala Leu 



GAG GTC 
Glu Val 
175 

GAC GGC 
Asp Gly 
190 

GCG CAG 
Ala Gin 



GGC GCC 
Gly Ala 



CTG GCC 
Leu Ala 



TTC TTC 
Phe Phe 
255 

GGC GGC 
Gly Gly 



Leu Val Ser 
65 

TCC GAG AAG 
Ser Glu Lys 
80 

ACC GAG CGC 
Thr Glu Arg 



ACG CAC GTG 
Thr His Val 



CTG GCC GAC 
Leu Ala Asp 
130 

AAC GGC CTG 
Asn Gly Leu 
145 

GGG CTG AGC 
Gly Leu Ser 
160 

GAC ACG GCC 
Asp Thr Ala 



AGC AAG CGC 
Ser Lys Arg 



CGC TGG GTG 
Arg Trp Val 
210 

GCC CAC GGC 
Ala His Gly 
225 

CAC CAG GGC 
His Gin Gly 
240 

CCC TCC GTC 
Pro Ser Val 



CTG CGG GAG 
Leu Arg Glu 



INPUT SET: S19158.raw 
Lys Gly Leu 



GAC AAG AAG 
Asp Lys Lys 
85 

CGC GCG CTG 
Arg Ala Leu 
100 

GGC CAC CTG 
Gly His Leu 
115 

GCG TTC GAC 
Ala Phe Asp 



ACG GAG CTG 
Thr Glu Leu 



GTC TCC AAG 
Val Ser Lys 
165 

ACG CAC TAG 
Thr His Tyr 
180 

ACG ATT ACG 
Thr He Thr 
195 

CTG TCC AAC 
Leu Ser Asn 



TTC GTG GCG 
Phe Val Ala 



GCG GAC GTC 
Ala Asp Val 
245 

ACC TGG CGC 
Thr Trp Arg 
260 

GGC ACG TCC 
Gly Thr Ser 



894 



942 



990 



1038 



1086 



1134 



1182 



1230 



1278 



1326 



1374 



1422 



1470 
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153 

154 

155 

156 

157 

158 

159 

160 

161 

162 

163 

164 

165 

166 

167 

168 

169 

170 

171 

172 

173 

174 

175 

176 

177 

178 

179 

180 

181 

182 

183 

184 

185 

186 

187 

188 

189 

190 

191 

192 

193 

194 

195 

196 

197 

198 

199 

200 

dOl 

202 

^03 

204 

205 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/808,031 



DATE: 07/22/97 
TIME: 10:54:22 



ACG CTG CTG 
Thr Leu Leu 
280 

TTC CGC GGC 
Phe Arg Gly 
295 

CAG GGC GCC 
Gin Gly Ala 
310 

CTC GAG AAG 
Leu Asp Lys 



ACG CGC TAG 
Thr Arg Tyr 



265 

TCC CTC 
Ser Leu 



AAG CTC 
Lys Leu 



CCC ACG 
Pro Thr 



CCC AAG CCG 
Pro Lys Pro 
360 

CGC GTC CAG 
Arg Val Gin 
375 

AAG ACG CGC 
Lys Thr Arg 
390 

GTC GTG AAT 
Val Val Asn 



GAC GTC GTC 
Asp Val Val 



AAG CCG GGC 
Lys Pro Gly 
440 

«CC TTC ATC 
Ala Phe He 
455 

CAG CTC ACG 
Gin Leu Thr 
470 



CGG CTG 
Arg Leu 
330 

GCG GAC 
Ala Asp 
345 

CGG CGG 
Arg Arg 



CTC TCC ACG 
Leu Ser Thr 
285 

CTG CAC GTC 
Leu His Val 
300 

TCG CCC GGC 
Ser Pro Gly 
315 

TCC GCC CTC 
Ser Ala Leu 



270 

GAA GCG 
Glu Ala 



INPUT SET: S19158.raw 
275 



GCC AAG 
Ala Lys 



ATC ACC 
He Thr 



GCG 
Ala 



GAA GTG 
Glu Val 



GTC GCC 
Val Ala 



GCG GCG 
Ala Ala 
410 

CGC CAG 
Arg Gin 
425 

CGC GAG 
Arg Glu 



CAC ATG 
His Met 



GAG CTC 
Glu Leu 



GAC CTG ACC 
Asp Leu Thr 



ACG CAG CGT 
Thr Gin Arg 
365 

GTG GAG GCG 
Val Glu Ala 
380 

CGC AAG GGC 
Arg Lys Gly 
395 

GGC AAG GAC 
Gly Lys Asp 



TTC 
Phe 
350 

CCC 
Pro 



AAG 
Lys 
335 

TCC 
Ser 



CCG CGG GAG 
Pro Arg Glu 
290 

GGC CCG CGC 
Gly Pro Arg 
305 

AAC GCG CTC 
Asn Ala Leu 
320 

CGG CTG GGC 
Arg Leu Gly 



TGG ACG AAG 
Trp Thr Lys 



CCC 
Pro 



CTC CGC GCC 
Leu Arg Ala 



GGC GAG TCG 
Gly Glu Ser 
445 

ACG GAC CCG 
Thr Asp Pro 
460 

GAG TCC ACG 
Glu Ser Thr 
475 



GAG GGC 
Glu Gly 



ACG CGG 
Thr Arg 



GCG CCC 
Ala Pro 
415 

GCC ATC 
Ala He 
430 

CTC GAG 
Leu Glu 



GTC GCG GTC 
Val Ala Val 
370 

TTC CGC GTG 
Phe Arg Val 
385 

CAG CGG GTC 
Gin Arg Val 
400 

GCG GCC CGA 
Ala Ala Arg 



CAC AAC CGG 
His Asn Arg 



GCG GTC CAG 1518 
Ala Val Gin 



GCC CTG CCC 1566 
Ala Leu Pro 



TGC CTG AAG 1614 
Cys Leu Lys 
325 

TTC ACC TAG 1662 
Phe Thr Tyr 
340 

GCG AAG CAG 1710 

Ala Lys Gin 

355 

CTC CTG TCT 1758 
Leu Leu Ser 



GCC AAG 
Ala Lys 



GCG AGC 
Ala Ser 



CAG CTC AAG 
Gin Leu Lys 
450 

GGC CGC GCC 
Gly Arg Ala 
465 

GCC GCT CCG 
Ala Ala Pro 
480 



CAC CCG GAC 1806 
His Pro Asp 



ACC GGG CTC 1854 
Thr Gly Leu 
405 

GTC CCG CGC 1902 
Val Pro Arg 
420 

AAG AAG GGC 1950 

Lys Lys Gly 

435 

GGC ATG GCC 1998 
Gly Met Ala 



TTC CTG GCT 2046 
Phe Leu Ala 



CAG GCG GAG 2094 
Gin Ala Glu 
485 



4. 
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INPUT SET: S191S8.mw 



206 


































207 


TGACGCTCAG CGCGCGTCCG 


TCGCCGACGT 


GCCGCGCGCC AGCAACGCCG CATTCAGCAA 


208 


































209 


CTCCGTCAGC CGGCGCGGGT 


AC 
























210 


































211 


































212 


(2) INFORMATION FOR SEQ ID NO: 2: 




















213 


































214 


(i) 


SEQUENCE 


CHARACTERISTICS : 


















215 




(A) 


LENGTH: 


263 


amino acids 
















216 




(B) 


TYPE : amino 


acid 




















217 




(D) 


TOPOLOGY : linear 




















218 


































219 


(ii) 


MOLECULE 


TYPE: protein 




















220 


































221 


































222 


































223 


(xi) 


SEQUENCE 


DESCRIPTION: SEQ IE 


► NO: 


2: 














224 


































225 


Val 


Lys 


Leu 


Lys 


Pro 


Gly 


Met 


Asp 


Gly 


Pro 


Lys 


Val 


Lys 


Gin 


Trp 


Pro 


226 


1 






5 










10 










15 




227 


































228 


Leu 


Thr 


Glu 


Glu 


Lys 


He 


Lys 


Ala 


Leu 


Val 


Glu 


He 


Cys 


Thr 


Glu 


Met 


229 








20 










25 










30 






230 


































231 


Glu 


Lys 


Glu 


Gly 


Lys 


He 


Ser 


Lys 


He 


Gly 


Pro 


Glu 


Asn 


Pro 


Tyr 


Asn 


232 






35 










40 










45 








233 


































234 


Thr 


Pro 


Val 


Phe 


Ala 


He 


Lys 


Lys 


Lys 


Asp 


Ser 


Thr 


Lys 


Trp 


Arg 


Lys 


235 




50 










55 










60 










236 


































237 


Leu 


Val 


Asp 


Phe 


Arg 


Glu 


Leu 


Asn 


Lys 


Arg 


Thr 


Gin 


Asp 


Phe 


Trp 


Glu 


238 


65 










70 










75 










80 


239 


































240 


Val 


Gin 


Leu 


Gly 


He 


Pro 


His 


Pro 


Ala 


Gly 


Leu 


Lys 


Lys 


Lys 


Lys 


Ser 


241 










85 










90 










95 




242 


































243 


Val 


Thr 


Val 


Leu 


Asp 


Val 


Gly 


Asp 


Ala 


Tyr 


Phe 


Ser 


Val 


Pro 


Leu 


Asp 


244 








100 










105 










110 






245 


































246 


Glu 


Asp 


Phe 


Arg 


Lys 


Tyr 


Thr 


Ala 


Phe 


Thr 


He 


Pro 


Ser 


He 


Asn 


Asn 


247 






115 










120 










125 








248 


































249 


Glu 


Thr 


Pro 


Gly 


He 


Arg 


Tyr 


Gin 


Tyr 


Asn 


Val 


Leu 


Pro 


Gin 


Gly Trp 


250 




130 










135 










140 










251 


































252 


Lys 


Gly 


Ser 


Pro 


Ala 


He 


»he 


Gin 


Ser 


Ser 


Met 


Thr 


Lys 


He 


Leu 


Glu 


253 


145 








150 










155 










160 


254 
































255 


Pro 


Phe 


Lys 


Lys 


Gin 


Asn 


Pro 


Asp 


He 


Val 


He 


Tyr 


Gin 


Tyr 


Met 


Asp 


256 








165^ 








170 










175 




257 


































258 


Asp 


Leu 


Tyr 


Val 


Gly Ser 


Asp 


Leu 


Glu 


He 


Gly Gin 


His 


Arg 


Thr 


Lys 




• 



PAGE: 1 SEQUENCE VERIFICATION REPORT DATE 07/22/97 

PATENT APPLICATION US/08/808,031 TIME: 10:54:29 

INPUT SET: S19158.raw 



Line Error 



Original Text 



« 



